The assessment of cognition/intelligence in infancy

نویسنده

  • Terri L. Shelton
چکیده

This article will review the development of infant cognitive assessment and will describe selected tests. Considerations in choosing, administering, and interpreting the results of infant intelligence/cognitive assessment instruments will be outlined. Finally, the usefulness of cognitive assessment will be discussed along with new approaches to assessment. Article: The cognitive assessment of infants, like intelligence testing in general, has its roots in the intelligence testing movement of the late 19th and early 20th centuries. Since that time interest in the development and use of infant cognitive assessment instruments has mushroomed. The passage of Public Law 99457 in October of 1986, the Education of the Handicapped Act Amendments, established early intervention services for infants from birth through 3 years of age in all states. This legislation also has contributed to an even greater interest in identifying techniques that can be used to document developmental delay, to identify infants at risk, to plan intervention services, and to evaluate their effectiveness. In light of this growing interest, this article will review the development of infant cognitive assessment and will briefly describe some of the tests in current use. Basic considerations in choosing and administering infant intelligence/cognitive assessment instruments will be discussed along with a review of factors that affect the interpretation of the results of these evaluations. Finally, the usefulness of cognitive assessment will be discussed, including the use of information in an interdisciplinary team, the predictive validity of tests, and new approaches to assessment. HISTORIC OVERVIEW Psychometric instruments Arnold Gesell' was the pioneer of infant assessment. Although most infant cognitive assessment instruments that were later developed included many of the items developed by Gesell, his primary purpose in developing the scales was to identify those infants in need of assistance in the area of welfare and hygiene. The initial scales, published in 1925, 1 consisted of 144 items in four general areas: motor, language, adaptive, and personal/social behavior. The purpose of the scales was broad and clinical. An updated version of the scales was published in 1974 by Knobloch and Pasamanick 2 ; it provided more adequate norms for infants from 1 month to 5 years of age. Although Gesell viewed the environment as important, he thought that development was primarily influenced by genetics and the maturational process. While the scales do not yield a specific level of cognitive abilities, but rather developmental quotients in the different areas, the Gesell scales provided the basis for many other assessment instruments, such as the Brunet-Lezine Scale, 3 published in 1951 and largely used in Europe, and the Griffiths Scale of Mental Development, published in 1954. 4 In contrast to Gesell's broad conceptualization of development, Canal 5 specifically set out to develop a standardized assessment of infant cognitive abilities that could be a downward extension of the Stanford-Binet. While Cattell used many of the items from the Gesell, she eliminated those thought to be unduly influenced by home training or large muscle control. She made the scoring more objective and standardized the test for infants from birth through 30 months. Like Gesell, the theoretical underpinnings of the test seemed to emphasize the inherent abilities of the infant and to diminish the role of the environment. Bayley 6,7 also used the Gesell schedules as a starting point in her first California First Year Mental Scale, published in 1933. In its present form the scales include both mental and motor scales and a behavior record for evaluating infants aged 1 month to 30 months. The Bayley scales are perhaps the most widely used of the formal infant intelligence scales, and there is a large body of research documenting the reliability and validity of the scales. Although initially the development of the scales seemed to reflect a more unitary view of intelligence, over time Bayley's concept of intelligence began to change. Intelligence, as measured by these scales, seemed to be viewed as emergent throughout infancy and functionally unique at different ages. In fact, Kohen-Raz's 8 scalogram analysis of the Bayley scales yields five scales that are very similar to the scales typically found on Piagetian-based sensorimotor instruments. Sensorimotor scales With the translation of the work of Piaget into English came a new theoretical approach to the study of infant development and to the assessment of cognition/intelligence in infants as well. The previous approaches by Bayley, Cattell, and Gesell assumed that intelligence was more or less a unitary trait and that development was nonhierarchical. In contrast, Piagetian theory viewed development as a series of hierarchical, qualitatively different stages, dependent on the infant's interaction with his or her environment. Three major assessment scales based on Piagetian theory have been developed: the Casati-Lezine Scale, 9 the Albert Einstein Scales of Sensorimotor Development by Corman and Escalona, 10 and the Infant Psychological Development Scales (IPDS) by Uzgiris and Hunt. 11 The Casati-Lezine test measures object search, the use of intermediaries, object exploration, and the combination of objects. The Einstein scales are designed for infants between 1 month and 2 years of age and assess skills in prehension, object permanence, and functioning in three-dimensional space. The IPDS is appropriate with infants aged 1 month to 2 years and is comprised of six scales: (1) visual pursuit and permanence of objects, (2) means of obtaining desired environmental events, (3a) vocal imitation, (3b) gestural imitation, (4) operational causality, (5) construction of object relations in space, and (6) schemes for relating to objects. In contrast to non-Piagetian-based scales, no developmental quotients or deviation IQ scores are obtained. Rather, infants are characterized with respect to their most advanced performance on each subscale. The IPDS appears to be as reliable and valid as the other psychometric approaches. Although the IPDS has not really been standardized, Uzgiris and Hunt' note that the various levels of cognitive organization are of psychologic significance in their own right and "need not be based on the individual's comparative status in a statistical distribution." 11px Concurrent with the development of Piagetian-based instruments in the 1960s and 1970s was the development of assessment instruments in several other areas: (1) neonatal assessment, (2) screening instruments, (3) assessment of skills for special populations, and (4) assessment-based programming instruments. Clearly the impetus for many of these instruments has been and will continue to be the growing interest in early intervention techniques, the need to identify infants at risk, and the survival of infants born prematurely or with disabilities. A brief description of selected and frequently used instruments in each of these categories appears in Appendix A. Appendix B is a checklist for assessing infant cognition. Further accounts of the historic development of infant assessment as well as current trends can be found elsewhere. 12-14 BASIC CONSIDERATIONS IN INFANT ASSESSMENT Purpose The assessment of infant cognition or intelligence can be useful in a variety of ways: to document delay or risk status, to design an intervention program, to provide valuable pre-treatment and posttreatment information for research or treatment regarding the effectiveness of a therapeutic intervention, or to predict future abilities. Therefore it is extremely important that an examiner first identify the function or purpose of the assessment. The purpose will determine not only the type of test (eg, screening, in-depth assessment) but also the necessary test characteristics (eg, time needed, reliability). Once the examiner's purpose is clear, it also is important to examine the purpose for which the assessment instrument was designed and to try to match these as much as possible. As measurement often deals with intangible variables that are not always directly observable, constructs such as cognition, intelligence, and personality must be measured indirectly. Each test author makes theoretical assumptions about this variable or construct. Information on the theoretical underpinnings of the particular test as well as other test characteristics, such as item selection and reliability and validity estimates, should be available in the test manual's description of the construction of the test. Skills of the examiner In choosing an assessment instrument, the examiner should evaluate whether he or she has the necessary training and experience to administer the instrument. Because of the special demands inherent in evaluating infants, examiners should have experience testing very young children and infants in general. For example, it is important to know techniques for eliciting the best performance from the infant, to make the family feel comfortable, and to know when to interrupt or to discontinue testing. In addition to general experience, experience with specific populations is equally important to recognize the limitations of infant assessment, to interpret the test results, and to use the information to assist caregivers or for intervention. For example, specific experience in evaluating infants with cerebral palsy or with sensory impairments and in interpreting test results is critical, particularly in light of the fact that few existing infant assessment techniques have been developed specifically for infants with these disabilities. Familiarity with the specific assessment instrument is required as well. Because the attention span and interest level of infants are brief, a good working knowledge of the assessment instrument can speed the administration of the items, maximizing the chance that the infant's optimal performance is obtained. Many of the assessment instruments specify the level of expertise needed. For example, the Bayley Scales of Infant Development 7 should not be administered by anyone who is not trained in developmental or intellectual testing. In contrast, the Hawaii Early Learning Profile (HELP) 15 can be administered by an early interventionist who needs no particular training in test construction or developmental testing per se. At a basic level, however, all examiners should know the limitations of the testing instrument, how representative the infant's performance was, and how well they administered the test so that these factors can be considered in the interpretation of results. The professional discipline of the examiner can vary. There are psychologists, nurses, early interventionists, special educators, among others, who can be competently trained examiners of infant cognitive abilities. However, the ability to interpret the results and to communicate these findings to the infant's parents and to other professionals is as important as the ability to choose and to administer the test. Because of these reasons it is often advisable to choose a developmental or clinical psychologist or other professional with specific knowledge of test construction and limitations of intellectual testing and with the clinical skills necessary to interpret and relate in a clear but compassionate way what may be distressing news to the family. Because the results of a cognitive assessment are important for other developmental areas, the examiner should also have a working knowledge of other develop-mental areas so that the results can be communicated to other professionals, such as a physical therapist or physician, in a meaningful way. Standardization sample In addition to matching the purpose of the assessment to the theoretical underpinnings of the test itself, it is important to determine whether the characteristics of the child being evaluated are similar to those of the group for whom the test was developed or standardized. Standardization is a mechanism for attaching meaning to raw scores and for making comparisons between and among individuals. Sometimes it is not possible to find an assessment instrument whose standardization sample is comparable to the characteristics of the infant being evaluated, especially when evaluation of infants with chronic illnesses or multiple disabilities is becoming more common. Nevertheless, when these infants are being evaluated, this factor should enter into the interpretation of the results. Performance Another consideration is the type of performance one is interested in obtaining. Cronbach 16 distinguished between measures of maximum or best performance and measures of typical performance. For example, the Brazelton Neonatal Behavioral Assessment Scale 17 is designed to examine the infant's best performance, and specific conditions are outlined that must be met before the infant's performance is considered optimal. Measures of typical performance, such as tests of personality, habits, or interests, are designed to examine an individual's usual, representative, or typical behavior. Psychometric issues Psychometric issues, such as reliability and validity, should enter into the choice of an assessment instrument. Reliability refers to the consistency and accuracy of measurement, that is, what proportion of an infant's score can be attributed to chance or error and what proportion is a "true score." There are a number of ways to estimate reliability (eg, test-retest, alternate forms, split half, and coefficient alpha). For example, if an infant is to be reevaluated in the near future, the test-retest reliability of a particular instrument would be an important consideration. A test with good test-retest reliability would give the examiner greater confidence that the results obtained on the second evaluation were a valid representation of the infant's abilities and not reflective of error or chance fluctuations in the test itself. All estimations of reliability are reported in the form of a correlation coefficient (eg, Pearson r) and range from 0 to 1.00. The closer a reliability coefficient is to 1.00 the better. However, because some degree of chance or error enters into every assessment, a reliability of 1.00 is rarely obtained. Reliabilities between .70s and .90s indicate that the test is generally stable and accurate. While good reliability is an essential characteristic of an assessment instrument, the information it provides is useless if the test lacks validity. Validity is the relationship between the variable or trait being measured and the procedures used to measure it. In other words, does the test measure what it says it does? There are three major types of validity: content, criterion-related, and construct. Content validity is whether the test covers the material it is supposed to cover. Criterion-related validity, such as concurrent or predictive validity, relates to the comparison of test scores with some external variable or "criterion" known or believed to measure the same variable or attribute. Similar to reliability, the correlation coefficient is the usual index of criterion validity. Unlike the reliability coefficient, the value of the validity coefficient must be squared to determine the percentage of the infant's score that is due to chance fluctuations and the percentage thought to be a true reflection of the infant's abilities. For example, suppose the relationship or correlation between an infant's score on a test of cognitive abilities at 12 months of age and an intellectual assessment at 6 years of age is .70. Squaring .70 results in .49. This number indicates the proportion of variance shared in common by the two tests. Thus 49% of the infant's score is believed to be based on the infant's cognitive abilities; 51% is attributable to error, chance, plus the effects of some other undetermined variables. How high a validity coefficient should be will vary. Obviously the higher the validity coefficient the more confidence the examiner can place on the results of the test being an accurate reflection of the infant's abilities. Given the choice between two measures where everything else is equal, it would be better to choose the one with the higher criterion validity. Construct validity examines the meaning of the test. It is determined by examining the relationship of the test to the variables the test is intended to assess as well as the relationship to those that should have no relationship to the domain underlying the instrument. Construct validity cannot be determined on the basis of any one study but is best demonstrated by an accumulation of supportive evidence from different sources over time. For example, the construct validity of an infant assessment instrument would be determined by demonstrating high positive correlations or a strong relationship with other measures of intelligence and a low correlation or minimal relationship with a measure of a presumably unrelated trait or variable, such as temperament. INTERPRETATION OF TEST RESULTS Criterion v norm referencing Assessment instruments differ on whether test scores are left as raw scores or are translated into criterionreferenced or norm-referenced scores. Raw scores alone provide no stable point of comparison between individuals since scores might indicate what is a normal or expected performance on a particular test. To aid in interpreting the infant's score, the raw score can be translated. In criterion-referenced instruments, the raw score is translated into a statement about how that child's performance compares with a specified behavioral criterion established for that specific test. The principal use of criterion referencing has been in the development of mastery learning tests. The tests are designed to measure whether an individual has or has not attained mastery of a specific content area. The content of these criterion-referenced measures is usually designed to test acquisition of a relatively small domain of content and skills. Age-equivalent scores, mental age, ratio intelligence quotients, and some developmental quotients are examples of criterion-referenced scores. The advantage of these conversions is that they are readily understandable and are suitable when one is interested in whether an infant has acquired a specific skill. However, criterion-referenced tests should not be used if one is interested in comparing an infant's performance to the average performance of a group of infants in general. When this type of information is desired, norm-referenced instruments are more appropriate. Most norm-referenced measures are based on an overview of some broad content domain (eg, intelligence), in contrast to the criterion-referenced tests that examine a specific skill (eg, fine motor). For example, the Early Learning Accomplishment Profile (Early LAP, see Appendix A), a criterion-referenced measure, is designed to provide a record of the child's existing skills in the major developmental areas. The Bayley Scales of Infant Development, 7 a norm-referenced instrument, provides a comparison of an infant's mental and motor abilities with those of other infants. Norm-referenced scores are most commonly reported as percentile ranks, z-scores, tscores, stanines, and deviation intelligence quotients. Correction for prematurity Since an increasing number of the infants whose cognitive abilities are being evaluated have been born prematurely, another consideration in the interpretation of results is the issue of correction for prematurity. There continues to be debate over whether one uses an infant's gestational or chronologic age in the calculation of the scores and how long one uses this conversion. It has typically been the practice to correct for prematurity during the first two years of life. 18-20 However, there is growing evidence that this may not be the most appropriate practice. Caputo et al, 21 Siegel, 22 and others have indicated that correction for the degree of prematurity appears to be appropriate in the first few months or during the first year. However, a slightly more accurate prediction is achieved by using the uncorrected scores. As the Siegel 22 study points out, biologic immaturity alone is not the only consideration. Environmental influences, low birth weight, neurologic insult, among others, influence test scores. Furthermore, the degree of prematurity is a consideration. It might be more appropriate to correct for prematurity longer if the child was born at 28-weeks gestation than if the child was born at 34 weeks. In addition, as the medical conditions under which premature infants are born improve over time, correction for prematurity must be constantly reevaluated. Whatever the correction method used, it is important to discuss with the child's family the child's current rate of development and what rate would be necessary for the infant to "catch up" in the future. Many parents have reported disappointment when an early prediction of catching up was not accurate. The box is an example of how the infant's rate of development can be explained. At the January evaluation this infant was only one month behind his or her gestational age and three months behind his or her chronologic age. If the infant gains one month of skills for each month of life until the next evaluation, he or she will still be three months behind his or her chronologic age, but the relative percentage of delay (75% v 83%) would be less. To catch up with his or her chronologic age by the next evaluation, the infant would need to gain nine months of skills in only six months time—an accelerated rate of development. Sharing this type of discussion with the family can be helpful in setting realistic expectations for the infant's development. Continuing to use a child's gestational age for some time (eg, until 3 years of age) without some type of discussion about the infant's rate of development can place the family at risk for bitter disappointment when the child does not make up all the delay. It can, in some cases, lead the family to be disappointed with what may in fact be a good rate of development (eg, month-for-month gain). Testing situation When choosing an assessment instrument and when interpreting the results the examiner should consider where the child is being evaluated. Evaluation results obtained during an assessment in familiar surroundings such as the child's home may be very different from those obtained on the same child in unfamiliar or potentially disruptive settings (eg, developmental evaluation clinic). These factors are critical in the interpretation of the results for a child of any age but particularly for an infant. The timing of the evaluation also is an important factor. If one has any flexibility in scheduling a cognitive evaluation, it would be important to contact the family to determine when the infant normally naps or feeds. Trying to elicit the infant's interest in a structured task during a regular naptime is less likely to yield the best performance. Since many infants receiving cognitive evaluations may have other health difficulties, possibly as the result of prematurity, the infant's health status should be considered. Was the infant fatigued following several developmental or medical evaluations? Does the infant have an ear infection or some other acute condition that may be affecting his or her performance? While it is not always possible to schedule the evaluation when these factors are not concerns, some mention of them in the interpretation of the results is important. Conferring with other professionals who have ongoing contact with the infant or who may have evaluated the infant earlier in the day or asking the parents if the infant's performance was representative or typical of what they see at home are ways in which the representativeness of the infant's performance can be estimated. Some statement about how valid the test results are thought to be should be included in every developmental evaluation report. Parent's report Asking parents about the representativeness of their infant's performance will automatically lead to the question of whether to include, and how to include, the parent's report in the interpretation of results. Historically, professionals' estimation of the validity of a parent report has been that parents overestimate their child's abilities. As a result a parent's report typically has not been included in the analysis of results or in the report. Furthermore, the differences in perspective are not usually discussed or reconciled, often leading professionals to label the parents as denying the child's true developmental limitations and the parents to discount the examiner's results because they did not account for skills not displayed in the artificial testing situation. There are many benefits for including a parent's report. First, it is a source of important and accurate information about the child's abilities. While referring to a parent report about parent-child interaction, Maccoby and Martin's' comment is applicable to the assessment of an infant's cognitive abilities as well. Using parents as informants has great potential advantages. For assessment of behavior that varies considerably across situations or behavior that is usually not displayed in public, reliable observational data are difficult to obtain and parent inter-views are often the only viable alternative. Parents have an opportunity to observe their children and the patterns of interaction in their families over extended periods of time in a broad range of situations. Thus by virtue of their daily participation in the family system, parents have access to a truly unique body of information about the family, and it is reasonable to tap into this information by questioning them. 23(p16) There is growing evidence that there is not much difference between the parent and the professional perspective, and where different the parents' perspective may be more accurate. For example, in one study by Honzik et al, 24 use of mothers' reports contributed to more accurate findings. Infants who were suspected, based on birth records, of having neurologic impairment vocalized with greater frequency during their cognitive evaluations at 8 months of age than was true of the normal control group. Honzik et al 24 concluded that the infants in the control group were more inhibited by the strangeness of the test situation and vocalized less in the test situation, thus failing the vocalization items on the Bayley. On the Griffiths and the Gesell scales, the mothers' reports of vocalizing would have been credited, thus leading to a more accurate representation of the

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Investigating the relationship between intelligence believe and academic procrastination with mediating role of social cognition.

Introduction: Academic procrastination is one of the important issues that has been considered by many researchers in recent years and it has been mentioned as a bad habit and a behavioral problem and a special type of behavioral procrastination. Many adults experience regular daily activities, so the present study aimed to investigate the relationship between IQ and academic procrastination th...

متن کامل

Assessment of chronic diarrhea in early infancy in Tehran Tertiary Care Center; Tehran-Iran

Introduction Chronic diarrhea of infancy is a heterogeneous syndrome that includes several diseases with different etiologies. The aim of this study was investigating chronic diarrhea, its etiologies, clinical features and outcomes in infancy.Materials and Methods Retrospective study investigating infants hospitalized in the gastroenterology department of Tehran tertiary care center.The main de...

متن کامل

Exploring Emotion Recognition Patterns among Iranian People Using CANTAB as an Approved Neuro-Psychological Assessment

Background: Emotion Recognition is the main component of social cognition and has various patterns in different cultures and nationalities. The present study aimed to investigate emotion recognition patterns among Iranians using the Cambridge Neuro-Psychological Test Automated Battery (CANTAB) as a valid neuropsychological test. Methods: In this descriptive-analytical study, 117 males and fema...

متن کامل

Smog, Cognition and Real-World Decision-Making

Cognitive functioning is critical as in our daily life a host of real-world complex decisions in high-stakes markets have to be made. The decision-making process can be vulnerable to environmental stressors. Summarizing the growing economic and epidemiologic evidence linking air pollution, cognition performance and real-world decision-making, we first illustrate key physiological and psychologi...

متن کامل

Portfolio Assessment and the Enhancement of Higher Order Thinking through Multiple Intelligence and Dialogic Feedback

The current emphasis on higher order thinking skills (HOTS) has inspired many EFL educators to explore the impact of merging different pedagogical teaching and assessment strategies on the enhancement of thinking skills. Responding to such a growing need to investigate the effect of diverse teaching strategies on HOTS, the present study aimed to explore the impact of the integration of portfoli...

متن کامل

Dietary patterns in infancy and cognitive and neuropsychological function in childhood.

BACKGROUND Trials in developing countries suggest that improving young children's diet may benefit cognitive development. Whether dietary composition influences young children's cognition in developed countries is unclear. Although many studies have examined the relation between type of milk received in infancy and subsequent cognition, there has been no investigation of the possible effect of ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010